A New Term Representation Method for Gender and Age Prediction

نویسندگان

چکیده

Author Profiling is a kind of text classification method that used for detecting the personality profiles such as age, gender, educational background, place origin, traits, native language, etc., authors by processing their written texts. Several applications like forensic analysis, security and marking are techniques author profiling finding basic details authors. The main problem in domain preparation suitable dataset predicting characteristics PAN one organization conducting competitions on various types shared tasks. In 2013, organizers presented task series continued this further years. They arranged different kinds datasets varieties languages. From 2013 onwards several researchers proposed solutions to predict features utilizing provided competitions. Researchers character based, lexical or word structural features, syntactic, content style based distinguishing author’s writing styles Most observed words phrases those most useful work, experiment conducted with important terms age group gender from competition datasets. Two 2014 2016 experiment. documents converted vector representation which format giving training machine learning algorithms. term document plays crucial role improve performance prediction.The Term Weight Measures (TWMs) purpose represent significance value representation. we developed new TWM representing TWM’s efficiency compared other existing TWMs. Machine Learning (ML) algorithms SVM (Support Vector Machine) RF (Random Forest) considered estimating accuracy approach. We recognized accomplished best accuracies prediction two Datasets.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Document Weighted Approach for Gender and Age Prediction Based on Term Weight Measure

Author profiling is a text classification technique, which is used to predict the profiles of unknown text by analyzing their writing styles. Author profiles are the characteristics of the authors like gender, age, nativity language, country and educational background. The existing approaches for Author Profiling suffered from problems like high dimensionality of features and fail to capture th...

متن کامل

A New IRIS Segmentation Method Based on Sparse Representation

Iris recognition is one of the most reliable methods for identification. In general, itconsists of image acquisition, iris segmentation, feature extraction and matching. Among them, iris segmentation has an important role on the performance of any iris recognition system. Eyes nonlinear movement, occlusion, and specular reflection are main challenges for any iris segmentation method. In thi...

متن کامل

A New IRIS Segmentation Method Based on Sparse Representation

Iris recognition is one of the most reliable methods for identification. In general, itconsists of image acquisition, iris segmentation, feature extraction and matching. Among them, iris segmentation has an important role on the performance of any iris recognition system. Eyes nonlinear movement, occlusion, and specular reflection are main challenges for any iris segmentation method. In thi...

متن کامل

A New Dictionary Construction Method in Sparse Representation Techniques for Target Detection in Hyperspectral Imagery

Hyperspectral data in Remote Sensing which have been gathered with efficient spectral resolution (about 10 nanometer) contain a plethora of spectral bands (roughly 200 bands). Since precious information about the spectral features of target materials can be extracted from these data, they have been used exclusively in hyperspectral target detection. One of the problem associated with the detect...

متن کامل

a critical discourse analysis on gender representation in the iranian english books 2, 3 and top notch 2a, 2b a critical discourse analysis perspective

هدف از انجام تحقیق حاضر، مطالعه و بررسی نقش زنان و مردان در دو مجموعه از کتابهای درسی انگلیسی در قالب تحلیل گفتمان انتقادی و همچنین، بررسی و آشکار ساختن اصول ایدوئولوژیکی مربوط به نقش زنان و مردان در این کتابها است. این دو مجموعه کتاب عبارتنداز: کتابهای انگلیسی سال دوم و سوم دبیرستانی ایران و کتابهای تاپ ناچ 2آ و 2ب. برای انجام این منظور، مدل تحلیل گفتمان انتقادی فرکلاف مورد استفاده قرار گرفته ...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal on Recent and Innovation Trends in Computing and Communication

سال: 2023

ISSN: ['2321-8169']

DOI: https://doi.org/10.17762/ijritcc.v11i5s.6633